Can latent speech quality dimensions be quantified directly?
نویسندگان
چکیده
Previous studies revealed that the quality of transmitted speech can well be described by three perceptual dimensions: “discontinuity”, “noisiness”, and “coloration”. In this paper, a method is presented for quantifying these dimensions directly in subjective tests. Two experiments were conducted according to this method with large sets of test conditions. A detailed description of the test procedure and a thorough analysis of the results is presented. The proposed method is reliable and produces meaningful and orthogonal results for diverse stimuli sets, in particular for stimuli containing multi-dimensional degradations. The method might form the basis for collecting the data for future diagnostic speech quality estimators.
منابع مشابه
Principles for Learning Controllable TTS from Annotated and Latent Variation
For building flexible and appealing high-quality speech synthesisers, it is desirable to be able to accommodate and reproduce fine variations in vocal expression present in natural speech. Synthesisers can enable control over such output properties by adding adjustable control parameters in parallel to their text input. If not annotated in training data, the values of these control inputs can b...
متن کاملUsing Bacillus Cereus as a Geo-Biological Marker For Gold Prospecting in Iran
Several methods have been developed for gold exploration in the past, among which biological base method is known to be the most efficient with least expenses. This method can also be used for latent gold prospects exploration. In the present study, the possibility of applying Bacillus cereus frequency in soil as a biological marker was investigated for the exploration of latent gold prospectin...
متن کاملModeling of Integral Quality Based on Perceptual Dimensions - A Framework for a New Instrumental Speech-Quality Measure
In this contribution, the general framework for a new instrumental measure for end-to-end speech transmission quality is described. It is based on the notion that integral quality can be described by the global perceptual dimensions “discontinuity”, “noisiness”, and “coloration”. The dimensions were identified through multidimensional analyses of telephone speech quality in an end-to-end contex...
متن کاملAnalyzing Technical Causes and Perceptual Dimensions for Diagnosing the Quality of Transmitted Speech
We present an analysis of technical causes and corresponding perceptual dimensions of the quality of transmitted speech. Four experts annotated speech files of a common database according to a methodology which is currently being discussed for the future ITU-T Recommendation P.TCA. The annotations are analyzed with respect to their frequency and consistency, and compared to overall quality valu...
متن کاملIs intelligibility still the main problem? a review of perceptual quality dimensions of synthetic speech
In this paper, we present a comparative overview of 9 studies on perceptual quality dimensions of synthetic speech. Different subjective assessment techniques have been used to evaluate the text-to-speech (TTS) stimuli in each of these tests: in a semantic differential, the test participants rate every stimulus on a given set of rating scales, while in a paired comparison test, the subjects rat...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2017